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Abstract. We show how coupling techniques can be used in some metastable 
systems to prove that mean metastable exit times are almost constant as functions 
of the starting microscopic configuration within a "metastable set". In the example 
of the Random Field Curie Weiss model, we show that these ideas can also be used 

^-j \ to prove asymptotic exponentiallity of normalized metastable escape times. 
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CL, 1 1. INTRODUCTION 

CD ! 

1.1. The problem. Metastable systems are characterized by the fact that the state 
\Q '. space can be decomposed into several disjoint subsets, with the property that tran- 

sition times between subspaces are long compared to characteristic mixing times 
within each subspace. The mathematically rigorous analysis of Markov processes 
exhibiting metastable behavior was first developed in the large deviation theory 
of Freidlin and Wentzell [71 [T4ll . This approach yields logarithmic asymptotics of 
transition times and other quantities of interest. Over the last decade, a potential 
theoretic approach H, ED to metastability was developed that in many instances 
yields more precise asymptotics, and in particular the exact prefactors of exponen- 
'. tial terms. 

In this work we study metastability for a class of stochastic Ising models. The 
main objective is to extend the potential theoretical approach for deriving asymp- 
totics of transition times for processes starting from individual microscopic config- 
urations, and, subsequently, for studying exponential scaling laws for these transi- 
tion times. 

So far the existing methods work well in the following situations: 
(1) the process is strongly recurrent in the sense that it visits an individual atom 
of the state space in each metastable state many times with overwhelming 
probability before a metastable transition happens. This situation occurs, 
e.g., in Markov chains with finite state space, and on discrete state space, 
such as Z d , in the presence of a confining potential. 
2) in models where strong symmetries allow the analysis of the dynamics 
through a lumped chain that satisfies the requirements of 1). This situation 
occurs, e.g., in mean field models such as the Curie-Weiss model lfl"2ll and 
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the Curie-Weiss model with random magnetic fields that take only finitely 
many values ITTT| [3H. 
3) in situations where the process returns often to small neighborhoods, O e (x) 
of points, x, in a metastable state where the oscillations of harmonic func- 
tions on these neighborhoods can be made arbitrarily small. This is the 
case in finite and some infinite dimensional diffusion processes [9l [5U . 

One would expect that the situation 3) also arises in a wide variety of stochastic 
Ising models or stochastic particle systems exhibiting metastable behaviour. Prov- 
ing the respective regularity properties of microscopic harmonic functions appears, 
however, to be a difficult issue in general. 

The purpose of the present paper is to develop an approach to this problem via 
coupling techniques that allow to cover at least some interesting situations. 

A key idea of the potential theoretic approach is to express quantities of physical 
interest in terms of capacities and to use variational principles to compute the latter. 
A fundamental identity used systematically in this approach is a representation 
formula for the Green's function, g B (x, y), with Dirichlet conditions in a set B, that 
reads (in the context of arbitrary discrete state space) 

9b{x, y) = n{y) hx f y m , (1.1) 
cap(x, B) 

where B is a subset of the configuration space, h XtB {y) = h{ x ^ B , and h AtB is the 
equilibrium potential, i.e. 

fl, iiyeA 
h A ,B(y)=\^ ifyeB (1.2) 

I F y (r A < r B ) , otherwise. 

We use 

t c = min {t > : x(t) G C} 

for the first hitting times of sets C, and cap(A, B) is the capacity between the sets 
A and B; cap(x, B) = cap({x} , B) . 

(II. ID immediately leads to a formula for the mean hitting time E x r B of B, for 
the process starting in x. However, the resulting expression for ~E x tb is useful as 
long as the ratio appearing in (II. ID is under control and is not seriously of the form 
0/0. 

To be more precise, it may happen that h x ^ B (y) = f (A) h A)B (y) and cap(x 1 B) = 
f(A)c&p(A } B), for "macroscopic" sets A 3 x. Then 

K, B {y) _ h A , B (y) ^ 



cap(x, B) cap(A, B) ' 

but except in cases where (II. 3D is manifest by some symmetry, it will be very 
hard to establish such relations by a direct point-wise estimation of numerator and 
denominator in (II. ID . 

Examples where this problem occurs are diffusion processes in d > 1, Glauber 
dynamics in the case of finite temperature, etc.. In such cases a useful version can 
be extracted by averaging Eq. (II. ID with respect to x after multiplying both sides 
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by cap(x, B) over as suitable neighborhood A = A x . This yields the formula 

where va is a specific probability distribution on A. Actually (11.41) can be derived 
without a recourse to (II. ID : If P is the transition kernel of a reversible Markov 
chain x(t), then the equlibruim potential }ia,b is harmonic outside A U B; (I — 

P)h A , B = Lh A)B = 0. Thus, 

h A , B (y) = y^ j g B {y,x)Lh A ,B{x) (1.5) 

x<EA 

for all y £ B. By reversibility fi(y)g B (y, x) = fi(x)g B (x, y), and it follows that 

n(y)h A ,B{y) = y^//(g)^A t B(g)E 3! rB, (1.6) 

which is (11.41) with ^(x) = /j,(x)Lh A ,B(x) /c&p(A, B). 

The point is that the right-hand side of (II .40 can be evaluated in many cases of 
interest when formula (II. ID suffers from the problem discussed above. This has 
been demonstrated recently in two examples, the Glauber dynamics of the random 
field Curie-Weiss model at finite temperature 0]], and the Kawasaki dynamics in 
the zero temperature limit on volumes that diverge exponentially with the inverse 
temperature [0. 

An obvious question is whether the mean hitting time of B really depends on 
the specific initial distribution v A or whether, for all z G A, E 2 r B is equal to E ua t b 
up to a small error. This question, and related one concerning other functions 
of initial conditions is of much further reaching importance. In particular, it is 
relevant for proving the asymptotic exponentiallity of the transition time using 
approximate renewal arguments. Let us mention that the same issue also arises in 
the case of diffusion equations in the Wentzell-Freidlin regime. Here Martinelli et al 
IfTOl [9tl showed that solutions of the stochastic differential equation starting at two 
different points in a neighbourhood of a stable equilibrium and driven by the same 
noise are converging exponentially fast to each other with probability tending 
to one. From this they deduced regularity of exit probabilities F x [r B > iEr B ] as 
functions of x and hence exponentiallity of t b and asymptotic independence of 
~E x r B of the starting point x e A. Such a strong contraction property is, however, 
not available in stochastic Ising models on the level of microscopic paths. 

In the present paper we will develop a method that allows to obtain similar re- 
sults, at least in some cases, with an alternative and, weaker input. It is based on 
coupling techniques and allows to turn the following simple heuristic argument 
into a rigorous proof: The Markov chain should mix quickly before it leaves a 
substantial neighborhood of the starting point x; since the mixing time is short 
compared to the hitting time r B , the mean of t b should be the same for all starting 
configuration in A. Moreover, the chain will return many times to A before reach- 
ing B; by rapid mixing, the return times will be essentially i.i.d., hence the number 
of returns will be geometric, and the scaled hitting time will be exponential. 

To demonstrate the usefulness of this approach, our key example will be the 
Random Field Curie-Weiss model with continuous distribution of the random fields. 
In that sense, the present result is also a completion of our previous paper 0]] . 
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Technically, the coupling construction we employ, is based on QSj and still contains 
model dependent elements. However, the basic ideas are more general and will be 
of relevance for the treatment of a wider range of metastable systems. 

The remainder of this paper is organized as follows. In the next subsection we 
describe a general setting of Markov chains to which our method applies. In Sub- 
section 1.3 we state our two main theorems. In Section 2 we recall the definition 
of Glauber dynamics for the random field Curie-Weiss model and recall the main 
result from [ 1 ] . In Section 3 we recall the coupling constructed by Levin et al for 
the standard Curie-Weiss model and show how this can be modified to be useful in 
the random field model. We then prove Theorem 11.11 In Section 4 we show how 
to prove the asymptotic exponentiallity of the transition times and give the proof 
of Theorem ll.2l 

1.2. Setting. In this subsection we describe a general setting in which our meth- 
ods can be applied. 

In the sequel iV will be a large parameter. We consider (families of) Markov 
processes, cr(t), with finite state space, S N = {—1, 1}^, and transition probabilities 
p N that are reversible w.r.t. a (Gibbs) measure, fi N . Transition probabilities p N 
always have the following structure: At each step a site x G A is chosen with 
uniform probability 1/N. Then the spin at x is set to ±1 with probabilities p^(a); 
Pi( a ) +Px( a ) = 1- m tri e sequel we shall assume that there exists a G [1/2, 1) such 
that 

maxp^(cr) < a. (1-7) 

A key hypothesis is the existence of a family of "good" mesoscopic approximations 
of our processes. By this we mean the following: There is a sequence of disjoint 
partitions, {A", . . . , A£ }, of A = {1, . . . , A^}, and a family of maps, : S N — > 
l '„ C R n , given by 

We will always think of these partitions as nested, i.e. |a" +1 , . . . , A^^j is a re- 
finement of {A™, . . . , A£ n }. On the other hand, to lighten the notation, we will 
mostly drop the superscript and identify k n = n, and refer to the generic partition 
Ai, . . . , A„. It will be convenient to introduce the notation 

S n [m] ee (m n ) -1 (m) = {a : m n {a) = m} 

for the set-valued inverse images of m n . We think of the maps m n as some block 
averages of our 'microscopic variables cr, ; over blocks of decreasing (in n) "meso- 
scopic" sizes. 

As is well known, the image process, m n (a(t)), is in general not Markovian. 
However, there is a canonical Markov process, m n (t), with state space T n and re- 
versible measure Q n = jj N o (m n ) _1 , that is a "best" approximation of m n (a(t)), 
in the sense that if m n (a(t)) is Markov, then m n (t) = m n (a(t)) (in law). For all 
m, m' G T n , the transition probabilities of this chain are given by 

r N (m,m/) = V" /i N (a)p N (a,a'). (1.9) 
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In the models we consider here the following two assumptions are satisfied: 

(A.l) the sequence of chains m n (t) approximates m n (a(t)) in the strong sense 
that there exists e(n) 4- 0, as n | oo, such that for any m, m' G T n , 



max 

(T6S™(ro],»'6S"[m'| 



p N {a,a')\S n {ml} 
r N (m, m') 



< e(n) 



(1-10) 



(A.2) The microscopic flip rates satisfy: If m n (a) = m n (r]) and a x = r] x , then 

=pt(v)- 

Note that our assumption (A.l) is much stronger then the maybe more natural 
looking 



max 

o-es n [m] 



r jv ( Wli El 



< e(n). 



Finally, we need to place us in a "metastable" situation. Specifically, we will 
assume that there exist two disjoint sets A = {a G S N : m n °(a) G A} and B = 
{a G S N : m n °(a) G B}, for some n and sets A,BC T no , a constant C > and a 
sequence a„ < oo, such that, for all n > n and for all a,r] G A, 



F<7 [TB < ^"(r?)] < One" 



-CN 



(1-11) 



where, with a little abuse of notation, we denote by T m n^ the first hitting time of 

the set S n [m n (r])]. 



1.3. Main results. In the setting outlined above we will prove the following theo- 
rem. 

Theorem 1.1. Consider a Markov process as described above, and let A,B be such 
that rfTTTD holds. Then, 

<e- CN ' 2 . (1.12) 

Remark. Assumptions (Al) and (A2) are formulated in the context in which we 
will prove our results. The restriction to the state space {—1,1}^ is mainly done 
because we need to construct an explicit coupling. It is rather straightforward to 
generalize everything to the case of Potts spins (S N = {1, . . . , q} N ) and maps m n 
whose components are permutation invariant functions of the spin variables on A". 

The claim of Theorem II .11 is trivial whenever ¥ a (T v < t b ) is exponentially close 
to one, as | oo. However, in the context of stochastic Ising models it is reason- 
able to expect that, for fixed a, 77 G A, P (7 (r J? < t b ) is exponentially small. That is, 
despite the fact that a chain starting at a spends an exponentially large amount of 
time in A, this time is not long enough for visiting more than a small fraction of the 
exponentially large number of microscopic points in A. An alternative approach 
is to try to construct a coupling between o and rj chains. In the case of the Curie- 
Weiss model (without random fields) a useful coupling algorithm was suggested in 
the recent paper [8]. This algorithm ensures that: 

(a) If m n (a s ) = m n (ri s ), then m n (a t ) = m n (r) t ) for all t > s. 

(b) The Hamming distance between a t and i] t is non-increasing in time. 



max 

<T,r)£A 



E„r B 
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In a way, this is reminiscent of the stochastic stability results of iTTOll . It is 
straightforward to adjust the construction of USD to the general context we con- 
sider here. But both (a) and (b) above would be lost, and it is not clear that such 
a coupling would work globally. 

Instead, our strategy is to use (II. IIP and to keep trying to couple the cr-chain 
with a typical 77-chain each time when a t enters S n (m(r])). In the sequel we call 
this the basic coupling attempt. Clearly, in view of a possible biased sampling, 
basic coupling attempts should be designed with care, which explains the relatively 
complicated construction in Subsection 13.21 It is based on fl8]], but we need to 
enlarge the probability space in order to achieve sufficient independence between 
decision making and properties of the eventually chosen 77-path. In particular, the 
fact that cr-chain and 77-chain meet will not automatically imply coupling. 

A second and related problem that tends to arise in the situation that we are 
interested in is the breakdown of strict renewal properties. This is a well-known 
issue in the theory of continuous space Markov processes where methods such 
as Nummelin splitting lfT3l were devised to prove ergodic theorem for the Harris 
recurrent chains. Here we would like to use renewal arguments e.g. to prove 
asymptotic exponentiallity of the law of t b . We will show that again coupling 
arguments can be used to solve such problems. 

As an example we will prove the following theorem. 

Theorem 1.2. In the random field Curie-Weiss model, for A and B chosen to satisfy 
the hypothesis of Theorem 

F a (r B /E a T B > t) ->• e _t , as N't 00, (1.13) 

for alia e A and for all t e R+. 

Theorem [L2] is proven in Section 3. The basic idea is to use our iterative coupling 
procedure for deriving a renewal-type equation for the Laplace transform of r B . 

2. The random field Curie-Weiss model 

The results of this paper are motivated by the study of the Glauber dynamics of 
the random field Curie-Weiss model (RFCW). We will show that the assumptions 
of the two theorems above can be verified in that model. In this section we briefly 
recall results for this model obtained recently in [ 1 ] and prove an elementary local 
recurrence estimate. 

2.1. The model and equilibrium properties. In the RFCW model the state space 
is Sn = {—1, 1}^, the Gibbs measure is given by 

/ijv(cr) = Z N l exp (-/3H N (a)) , (2.1) 

and the random Hamiltonian, H N , is defined as 

\ «eA / ieA 

where A = {1, . . . , N} and hi, i e A, are i.i.d. random variables on some probability 
space J 7 , F h ). 
The total magnetization, 

m N (a) = — E a »' ( 2 - 3 ) 

ieA 
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is an effective order parameter of the model, and the sets of configurations where 
the magnetization takes particular values play the role of metastable states. More 
specifically, we introduce the law of m N through 

Qp,N = fJ>p,N ° mj/-, (2.4) 

on the set of possible values T N = {-1,-1 + 2 /TV, . . . , 1}. Qp >N satisfies a large 
deviation property, in particular 

Zp, N Qp, N {m) = \/^pexp (-N(3Fp, N (m)) (1 + o(l)) , (2.5) 
with I N being the Legendre transform of 

^:^logcosh(t + /3^), (2.6) 

and with an explicit form for the rate function ("free energy"), Fp tN . The metastable 
states correspond to multiple local minima of Fp jN , whenever they exist. 

A crucial feature of the model is that we can introduce a family of mesoscopic 
variables in such a way that the dynamics on these mesoscopic variables is well 
approximated by a Markov process. Let us briefly describe these mesoscopic vari- 
ables. 

2.2. Coarse graining. Let / denote the support of the distribution of the random 
fields. Let I f , with i e {1, . . . , n}, be a partition of I such that, for some C < oo 
and for all £, < C/n = e. 

Each realization of the random field {hi} ieN induces a random partition of the 
set A = {1, . . . , AT} into subsets 

A k = {i e A : hi E I k }. (2.7) 

We may introduce n order parameters 



ieAfe 

We denote by m the n-dimensional vector (mi, . . . , m n ). m takes values in the set 

Tat = Xfc=i {-PAT,fe, -P^.fc + ^, • • • , PN,k - j?,PN,k} , (2.9) 

where 

Pk = pN,k = (2.10) 

Note that the random variables concentrate exponentially (in N) around their 
mean value E h p N>k = F h [hi G Ik] = Pk- The Hamiltonian can be written as 

n 

H N (a) = -NE(m{a)) + (2J1) 
where E : R n — > R is the function 

(71 \ ^ ^ 

J2 Xk ) +J2 hkXk > (2 - 12) 
fc=i y k=i 

with 

^ = y— | V" hi, and hi = h { - h t . (2.13) 

1 1 iGA £ 
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The equilibrium distribution of the variables m(a) is given by 

Qp,n{x) = P(3,N(m(a) = x) (2.14) 

- JLpPNE(x) ¥ « iP E?=iE ieA ,^(^-^) 
Zjn 

For a mesoscopic subset, A C T 1 ^, we define its microscopic counterpart, A, as 

A = S n [A] = {aeS N : m(a) G A} . (2.15) 

Note that, as in the one-dimensional case, we can express the right-hand side of 
(12351) as 



Zp, N Qp, N (x) = n J ^^ )/Pi) exp {-N0Fw(x)) (1 + o(l)) , (2.16) 



TVtt/2 

= 1 



with an explicit expression for the function Fi 



0,N, 



2 



Fp, N (x) = -- ( - ^ + 4 pdN,ii. x il Pi)- (2.17) 



The key point of the construction above is that it places the RFCW model in the 
context described in Section 1.2. Namely, defining the mesoscopic rates, r N (m } m/), 
in (I1.9D for the functions m defined in (I2.8D . one can easily verify that the estimates 
(11.10D hold, as was exploited in 0]] . In the next subsection we will show that the 
recurrence hypothesis (II. IIP also holds in this model. 
In [1]] we proved the following: 

Theorem 2.1. Assume that j3 and the distribution of the magnetic field are such that 
there exist more than one local minimum ofFp :N . Let m* be a local minimum of Fp ;N , 
M = M{m*) be the set of minima of Fp t jq such that F^{m) < Fp tN {m*), and z* be 
the minimax between m and M, i.e. the lower of the highest maxima separating m 
from M to the left respectively right. Then, F h -almost surely, 

^u s[rn , hsm rs[M] = C(f3, m\ M)Nexp (f3N [F PiN (z*) - F p>N (m*)]) (1 + o(l)) (2.18) 

where C(/3, m*, M) is a constant that is computed explicitly in [fl]]. 

Here the initial measure, vs[m*],s[M]> is the so-called last exit biased distribution 
on the set S[m*] = {a G S N : m N (a) = m*}, given by the formula 

VA,B\0) = ^ ( \-rm r y (2.19) 

Although the theorem is stated in [ 1 ] for the starting measure in a set defined 
with respect to the one-dimensional order parameter, the estimates given there 
immediately imply that the same formulas hold for Ej,„ ». s[m] t S [m], with m* a local 
minimum in the n-dimensional order parameter space. 

Theorem 12.11 implies that the estimate (I2.18D holds for ¥, a T S [M] for any a in a 
neighborhood of S[m*], for n large enough. 
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2.3. Local recurrence. Before starting the proof of (I1.12D . let us verify that the 
hypothesis (II. IIP holds for the RFCW model. Specifically, let us define the metastable 
set A s C T n as the ball, with respect to the Hamming distance, of fixed radius 5N, 
5 > 0, centered on a local minimum m* of Fp tN . Let A s C S N be the corresponding 
microscopic metastable set and denote by the first hitting time of the set S n [m] . 
With this notation we have: 

Lemma 2.2. There exist 5 > and c a > such that, for all n large enough, a, a' G A s , 

K [t b < r m{a/) ] < e" ClAr . (2.20) 

Proof. We first notice that if a' G S n [m*}, then the assertion of the lemma holds for 
all n sufficiently large with a constant c independent of n, as has been proven in 
ED (see Prop. 6.12). 
Moreover, for all a, a' G A s , 

K [r mi a') < r ffi .] > e~ c5N , (2.21) 

for some positive constant c. To see this, notice that, due to the property of A§, 
one can find a mesoscopic path from m(a) to m(cr') with length at most 5N. Im- 
plementing the argument that is used in the proof of Lemma 6.11 of HTfl, one gets 
(I2T2TD . 

To prove (12.20D . we use a renewal argument. Let us consider a configuration 

a G S n [m*} and a generic a' G A$, and set m = m(cr'). Then 

P<t(tb < Trn) < P ct (7b < T„ A T„») + ^ P CT (W < Tb < Trn , (j(Trn*) = 7]) 

7]£S n \m*) 

< P (J (r B < T m ») + max P„(r B < r m )P (T (r m » < r m ) 

— »?G<S"[rn*] _ _ _ 

< e~ coN + max P„(r B < (l - e"^) , (2.22) 

r)dS n \m*} 

where in the second line we used the Markov property, and in the last line we used 
the inequality (12.20D and (12.211) . Taking the maximum over a G S n [m*} on both 
sides of (I2.22D and rearranging the summation, we get the inequality (12.201) for 
o G S n [m*}, with a constant c\ = c — cd which is striclty positive for small enough 
5. 

Now let us consider the general case when a, a' G Ag and set again m = m(cr'). 
As before, we have 

f (T (T B <T m ) < P CT (r B < A w) + ^ P CT (t„* < T B < T„ , (t(t„. ) = 7]) 

7]GS n \m*] 

< P (7 (r B < T m *) + m a x P^TB < Trn}^a{rm* < t b) 

r]£S n \rn*] 

< e~ CoN + e~ ClN = e- ClN (l + o(l)), (2.23) 

where in the third line we used the fact that the inequality (I2.20D was already 
established for rj G S n [m*]. This concludes the proof of the Lemma. □ 

Lemma [2721 shows that, for all n large enough, the RFCW model parametrized by 
the variables m G T 7 ^ satisfies the hypothesis (II. IIP with A = A s as defined above. 
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3. Construction of the coupling 

3.1. The coupling by Levin, Luczak, and Peres. Recall that we consider a parti- 
tion, {Ai, . . . , A n }, of A = {1, ... , N} and let m = (mi(cr), . . . , m n (a)) be the vector 
of partial magnetizations as defined in (12.8D . 

We begin by explaining a coupling that was used by Levin et al. Q8J in the usual 
Curie-Weiss model. In that case the transition rates have the following properties: 
Whenever x, y and a, r] are such that 

(i) m(a) = m(j]), and 

(ii) cr x = T] y , 

then 

p N (a,a^)=p N ( V , V ^). (3.1) 
We continue to employ the notation p x (cr), 

p-*«((T) = Np N ((T,(Tto) and p+(a)+p-(a) = l (3.2) 

where, as usual, <jW is the configuration obtained from a by setting a% = —a x and 
leaving all other components of a unchanged. 

The coupling of Levin et al. is constructed as follows. Let a and rj be two 
initial conditions such that m(a) = m(rj). Let I t , t — 0, 1, 2, . . ., be a family of 
independent random variables that are uniformly distributed on A. Assume that at 
time t, rn(a(t)) = m(r}(t)) and do the following: 

(01) Draw the random variable /(; 

(02) Set rji t (t + 1) = ±1 with probabilities pf {r)(t)), respectively, and set rj x (t + 
1) = rj x (t) for alls ^ I t \ 

(A) Then do the following: 

(i) lia It (t) = r] It (t), then set 

* a It (t + l)=r ]li (t + l); 

* a x {t + 1) = a x {t), for all x ^ I t . 

(ii) If <Ji t (t) 7^ T]i t (t), then let A k be the element of the partition such that 
I t G A fc and choose y uniformly at random on the set {z e A fc : a z (t) ^ 
r) z (t) 7^ Vi t (t)}- Note that this set is not empty, since m(a(t)) = m(r}(t)) 
and a(t) and i](t) differ in one site of A k . Then set 

* a v (t + l) = Vlt (t + l); 

* ajt + 1) = a x (t), for all x^y. 

Note that this construction has the virtue that m(a(t+l)) = m(r](t+l)), so that the 
assumption inherent in the construction is always verified, if it is verified at time 
zero. 

Moreover, if a(t) = r](t) for some t, then a(t + s) = r](t + s), for all s > 0. Finally, 
one easily checks that the marginal distributions of a(t) and rj{t) coincide and are 
given by the law of the original dynamics. This latter fact depends crucially on the 
fact that the flip rates do not depend on which site in a given subset A 4 the spin is 
flipped, provided they are flipped in the same direction. 

3.2. Coupling attempt in the general case. In the general case we consider here, 
including the RFCW, (13. ID does not hold unless x = y. All we assume is (A.2) and 
(11.10D . The problem is now that the probabilities to update the a-chain in a chosen 
point y are typically not the same as those of the 77-chain in the original point I t . 
However, by (11.101) . these probabilities are still close to each other, in the sense 



EXPONENTIAL LAWS VIA COUPLING 



11 



that there exists v = u(n) with v \. as n | oo, e.g. u(n) = 3e(n), such that for any 
k, for any x,y e A k and for any a and 77 with m(cr) = 777.(77) and = 77^, 

< 1 + ^ (3-3) 



Thus, in order to maintain the correct marginal distribution for the processes, we 
have to change the updating rules in such a way that the cr-chain will sometimes not 
maintain the same magnetization as the 77-chain, which implies that the coupling 
cannot be continued. 

The basic strategy to overcome this difficulty is to use iterated coupling attempts. 
We shall decompose the cr-path on [0, rg) into cycles and during each cycle we 
shall attempt to couple it with an independent copy of the 77-chain. In the case of 
success, both chains will run together until r B - Such procedure necessarily involves 
a sampling of 77-paths. In order to control its bias, it will be important to separate 
the path properties of 77-chains with which we try to couple from the probability of 
whether a subsequent coupling attempt is successful or not. This will be achieved 
by constructing a coupling on an extended probability space. 

Basic coupling attempt. There are two parameters c 2 > and k < 00 whose 
values will be quantified in the sequel. 

Let 77 and a satisfy 777(77) = 777(a). We shall try to couple a cx-path with an 77-path 
during the first A K -steps of their life. Let M = c 2 N and let Vi,i = 1, . . . , M, be a 
family of i.i.d. Bernoulli random variables with 

P [Vi = 1] = 1 - P [Vi = 0] = 1 - v(n) (3.4) 

We now describe how the coupling construction is adjusted using the random vari- 
ables V u i = 1, . . . , M. Let 777,(77(0)) = m(<r(0)). 

As before, let I t , t = 0, 1, 2, . . . , N K , be a family of independent random variables 
that are uniformly distributed on A. Let Mq = and xo = 0, 77(0) = 77, er(0) = a. 
At time t > 1, do the following: 

(01) Draw the random variable I t ; 

(02) Set rji t (t + 1) = ±1 with probability p f t (rj It (t)) and set rj x (t + 1) = rj x (t) for 
all x ^ I t ; 

(A) If at time 1 < t < N K , xt = and M t < M, then do the following: 

(i) If a It (t) =rj It {t), then set 

* a It {t + 1) = Vlt (t + 1); 

* a x (t + 1) = a x (t), for all x ^ I t ; 

* Mt+i = M t . 

(ii) If oi t (t) ^ r]i t (t), let A fc be the element of the partition such that I t G A k 
and, as before, choose y uniformly at random on the set {z e A fc : 

Vz(t)^Vz(t)^Vi t (t)}- Then set 

* a y (t + 1) = r)i t (t + 1) with probability 

fl ifV Mt = l 

{ pf t (r;(t))A P ± (g(t))-(l-u)p% (r,(t)) (3.5) 
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r)i t (t + 1) with probability 

ifV Mt = l 
if W = 



ifV Mt = l 

p%(v(t))-pf t (v(t))A P tnt)) (3.6) 



* a x {t + 1) = a x (t), for all x ^ y. 

* If Vm± — 0, then set Xs = 1 for s = i + 1, . . . , N K , otherwise set 
Xt+i = Xt- 

* SetMt+i = M t + l; 

(B) If at time t, either xt — 1 or M-t — M , then update a independently of rj, 
i.e. 

(i) draw I[ independently with the same law as I t , and 

(ii) set + = ±1 with probability pf, (a (£)), and a x (t+l) = a x {t), for 
all x ±I[. 

The process M t is a counter that increases by one each time a new coin is used 
by the coupling. The value xt = 1 of the variable xt indicates that a zero coin was 
used by time t. 

The following lemma collects the basic properties of the process constructed 
above. 

Lemma 3.1. Let P denote the joint distribution of the processes a, rj, V defined above. 
Then the above is a good coupling in the sense that the marginal distributions of both 

r)(t),t < N K and a(t),t < N K under the law P are P CT ( ) and P^o), respectively. 

Proof. The assertion is obvious for the process r](t). It is also clear for the a(t) 
process if updates are done according to case B. Therefore, we only need to check 
that it holds for process a(t) at such times t < N K when xt is still and Mt is still 
less than M = c 2 N. In other words, we have to compute 

P [ a (t + 1) = a+{t)\a{t); X t = 0;M t < c 2 N] , (3.7) 

where a+ = (ax, . . . , a x -i, +1, . . . cr N ). First it is clear that, given that I t = x and 
ai t (t) = rji t (t), we get the desired result, i.e. 

P [a(t + 1) = a+(t)\I t = x- a x (t) = Vx (t); a(t); X t = 0;M t < c 2 N) 
= Pt(ri(t))=PtWt)). (3.8) 
In the case I t = y ^ x we get a contribution to (13.71) only if 

(i) x, y are in the same set A,, 

(ii) a y (t) ^ r) y (t), 

(iii) a x {t) ^ r] x (t), and 

(iv) a x (t) = r] y (t). 

If these conditions are satisfied the probability to flip o x to +1 is 

(i-u) P +( v (t)) + vpZW)- u P +( v (t)) ( 93 

_ Py(v(t))-p-(v(t))Ap-(<x(t)) 
= Pt(v(t))Ap+(a(t))+p-( V (t)) -p-( V (t))Ap-(a(t)) 

= Pint)). 
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The last line is easily verified by distinguishing cases. It follows that the probability 
in (13.7D is equal to A^ _1 p+(<j(t)), as desired. This proves the lemma. □ 

The construction above tries to merge the processes 77 (i) and a(t) only as long 
as xt = and M t < M. Note that if for some t < N K both these conditions 
still hold and, in addition, r](t) = a if), then the two dynamics automatically stay 
together until N K and, indeed, stay coupled forever. Naively, one would want to 
classify such situation as "successful coupling". However, this would involve an 
implicit sampling of ^-trajectories which may lead to distortion of their statistical 
properties. For example, it is not clear whether the correct value of E,,t b would 
survive such a procedure. 

In order to circumvent this obstacle, we use a more restrictive definition of what 
a "successful coupling" should be. Namely, we say that our basic coupling attempt 
is successful if the following two independent events A and B simultaneously hap- 
pen on the enlarged probability space: 

(1) The event 

A ={^1^ = 1}, (3.10) 

is the event that all M random variables Vi should be equal to 1 . 

(2) The event B depends only on the random variables 77 (t), t < N K . To define 
it, we introduce two stopping times, S and J\f. Let 

S x = inf {t : Vx (t + 1) = -tfc(O)} (3.11) 

and set S = maxi< x <x S x . Clearly, S x is the first time the spin at site x has 
been flipped and S is the first time all coordinates of r] have been flipped. 
J\f is defined as 

S N 

Af^Y.Y, 1 ^} 1 ^}- (3.12) 

t=0 x =l 

which is the total number of flipping attempts until time S. Finally, 

B = {t v b > N K } n {S < N K } n {AT < M} . (3.13) 

The important observation is the following. 

Lemma 3.2. On A n B, the coupling is successful in the sense that 

AnB C {r](N K ) = a(N K )}. (3.14) 

Proof. On the event B fi A, by time N k rj(t) has not reached B, all spins have been 
flipped once, and each flip that involved a site where 77 (i) 7^ a it) was done when 
the coin Vi took the value +1. Therefore on each first flip the corresponding r/ and 
a spins became aligned, hence r](N K ) = a(N K ). □ 

Remark. Note that the inclusion (13.141) is in general strict. The rationale for the 
introduction of the events A and B is that the unlikely event A does not affect 
the r/-chain at all and that the (likely) event B does not distort the hitting times of 
the ?7-chain in the sense that (r B l B ) > E^r^ (l — e~ ciV ). This will be part of the 
content of Lemma [3T3l which we formulate and prove below. 
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3.3. Construction of a cycle and cycle decomposition of a -paths. We have seen 
that Bf)A indicates that our coupling is successful and rj (t) and a it) arrive together 
in B. However, the probability of AnB is very small, essentially due to the fact that 
the probability of A is small, namely F(A) = (1 — v) M . What will be essential is 
that the probability of B is otherwise close to one, and therefore the 77-paths (which 
are independent of the Vi) will be affected very little by the occurrence of A fi B. 

We then have to decide what to do on (A fi B) c at time N K . Define the stopping 
time 

A = min{t> N K : a(t) G S n \m(r)))} . (3.15) 

If 

V = {A < t° b } (3.16) 

happens, then we initiate a new basic coupling attempt at time A for a new, in- 
dependent copy of the r^-chain and a chain starting from cr(A). Otherwise, on the 
event V c n (A n B) c , the process stops and coupling has not occurred. 

The cycle decomposition of cr[0,rg) is based on a collection jf/[0, t^' 7 ) j of in- 
dependent copies of 77-chains and on a collection {V_ e = (Vq , . . . , V^ K ) } of i.i.d. 
stacks of coins. The events {A e , B e } are well defined and independent. The events 
{T> e } are defined iteratively as follows: The event V° is simply the above event 
V defined with respect the coupling attempt based on {rj°,V_ }. If V° occurs we 
denote by 6 = A the random time at which the first cycle ends. Assume now 
that rio _1 P £ n n(A e n B e ) c happened and that the (k - l)-st cycle was finished at 
a random time 6 k -i and at some random point cr(^-i) € S n [m(r])]. Let us initiate 
a new basic coupling attempt using a new independent copy {r] k ,V_ k } for a chain 
starting at rj and a chain starting from a{6 k -i). The event V k , and accordingly the 
cycle length A k , are then defined appropriately. If V k happens and (~)(A e R B e ) c 
then 6 k = 6 k _i + A k , a(6 k ) e S n [m(rj)] is well defined as well, and the iterative 
procedure goes on. 

In the light of the above definitions, the enlarged probability space Q has the 
following disjoint decomposition, 

00 fc— 1 

1 = ^2l Ak l B k JJ(1 - l^laO l-pt (3.17) 



fc=0 £=0 



fc-1 



+ 5^(1 -i^i^ci- i^nc 1 - 

fc=o e=o 
As a consequence, we arrive at the following decomposition of the hitting time rg 
in terms of the (independent ) hitting times jr^ j, 

t b = J2 \ II ^0- - Iaw W [ (Vi + t-b") 1x*1b* 

fc=0 I f=0 J ro i 

00 (fc-1 } (3 - 18) 

fc=0 u=o J 

(In both formulas above we use the convention that products with a negative num- 
ber of terms are equal to 1 and set 6-1 = 0). Note that the first terms in (13.171) and 
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(13.181) correspond to the cases when the iterative coupling eventually succeeds, 
whereas the second term corresponds to the case when it eventually fails. 

3.4. Upper bounds on probabilities and proof ofTheorem ll.il 

Lemma 3.3. The following estimates hold uniformly in a,rj e A: 

(i) There is a constant c > 0, independent ofn, such that, for N large enough, 

F(B c )<e~ cN , (3.19) 

and 

(t b 1b) > E„r B (1 - e~ cN ) . (3.20) 

(ii) If N is large enough, 

P CT (27) > 1 - e' cN . (3.21) 

Proof Item (ii) follows from Lemma [2721 with, e.g., c = c\/2. To prove item (i), we 
write B c = {r B < N K } U {S > N K } U {Af > M}. Thus, 

1/3 C = l{r^<7V-} + l{ r ]>>jv«}l{S>jV«} + l{rg>AT«}l{5<W«}l{AA>M}- (3.22) 

Inserting this into Equations Q3.19D and (I3.20D . there are three terms to bound. 
The first term is easy: 

P„ (t b < N K ) < N K max F a > (r B < r„) < N K e - ClN , (3.23) 

a' :m(cr')=m 

The first inequality used the fact that in order to reach B, the process has to make 
one final excursion to B without return to the starting set m, and that there are 
at most N K attempts to do so. The last inequality uses (12.20D . The corresponding 
term for (I3720D is 

E„ (r B l {r , <iVK} ) < N 2K e- c > N . (3.24) 
The second term is also easy: First, 

Pr, ({r B > N K } C\{S> N K }) <F V (S> N K ) (3.25) 

and 

E„ (t b 1{t b >n«}1{S>n«}) < J2( NK + E -' rfi ) F v (VN* =ct';S> N k ) 

cr' 

< (N K + maxE f7 ,T B ^ v (S > N K ) . (3.26) 

Using the formula (11.41) with A = {cr'}, and bounding the corresponding capacity 
cap(a', B) > e~ C3N from below in the crudest way (e.g. retaining a single one- 
dimensional path from a' to B, see [2]), one gets that 

N K + maxE CT ,TB < e 2csN , (3.27) 

cr' 

where c 3 does not depend on n. 

Next we show that if k > 2 the probability ¥ V (S > N K ) is super-exponentially 
small. Indeed, since at each step the probability to flip each particular spin is 
bounded from below by (1 — a) /N, 

/ 1 — \ NK 

F v {S>N K )<Nh- -j^ J < e - C4jVK_1 . (3.28) 
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Finally, even the third term is easy: 

E„ (l{T B >N"}Ms<N«}t{X>M}) < P„ (AT > M) , (3.29) 

and, as in (13T261) . 

E v (t b 1 {tb>Nk} 1 {s<Nk} 1 w>m} ) < (n* + maxE CT , r s ) P„ (JV > M) 

< e 2c3Ar P„ (A/" > M) . (3.30) 

It remains to bound F v (A/" > M). In order to do this we split the time interval [0, S] 
into epochs 

[0, S] = [0, S h ] U (S^ S^] U ■ ■ ■ U (3.31) 

where i = {i 1} ...,i N }isa permutation of {1, ... , N} which is fixed by the order in 
which spins are flipped for the first time, 

S h <S i2 <---< S lN = S. (3.32) 

Fix a particular permutation i and let £[z\ be the event that (13.321) happenes. Let 
us first derive a lower bound on P^ (£ [i] ) . It is convenient to decompose S [z] = 

r\k=o £ k\i], where 

£o[i] = {No spin was flipped on [0, S h — 1)} fi {Spin i 1 was flipped on S n -t step} 

(3.33) 

and 

£fc[i] = {No spin was flipped for the first time during [S ik , S ik+1 — 1)} 

R {Spin Zfc+i was flipped on S ik+1 -t step} . (3.34) 

Let Afk be the number of times previously unflipped spins were attempted to flip 
during the interval (S ik , S ik+1 \. Clearly TV = J2k=o^k- 
In view of (fTTTl) . 

a e ° 

P^(^o[i];A^o = 4)<-^. (3.35) 

In order to give an upper bound on the probability of the events {£&[£]; A4 = £k}, 
for k > 0, we distinguish between two types of trials, which happen during the 
intervals (S ik , S ik+1 ). First, one might choose yet unflipped spins from {ik+i, ■ ■ ■ , in} 
but then fail to flip them. On the event {A4 = £ k } this happens exactly t k — 1 
times. Second, one might choose already flipped spins from the set . . . ,i k }. 
The probability of the latter is k/N, whereas, according to (11.71) , a uniform upper 
bound for the probability of the former option is a(N — k)/N. Thus, if Q k is the 
a-field generated by rj[ ,s k ], then 

F,M. = 4|*) < ^(^)'*- 1 (g(i) J )" (3,6, 



a 



N-k 

Therefore, 



fc-i 



N — k 

v^=0 / £=0 
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and, consequently, 

As a result we get that 

P„ W >M)< J2 aL ( N t L )-^ e- ln ^ L e N ^ 1+L ^ +1 l (3.39) 

L>M ^ ' L>JU 

For M = c 2 N and providing that c 2 is large enough, we finally obtain that 

P„ (jV > M) < e" C5jv , (3.40) 

for a constant, c 5 , increasing linearly with c 2 . Putting all estimates together con- 
cludes the proof of the lemma. □ 

Notice that if A D B happens, then m{rj t ) = m(a t ). In particular, A R B C 
{rg > iV K } and hence r£ = r n B on .A n B. 

Let us go back to the cycle decomposition (13.181) . Using E for the expectation on 
the enlarged probability space, 

oo (k-l \ 

Er ^E^ II ^ £ ^ - f Tb^a^b* (3.41) 

Let J z e k be the a-algebra generated by all the events and trajectories A e , B £ , V e , 
rf and a(9g-i, 0(\, t < k. In view of the independence of the copies {rj e ,V_ e }, 

E (r^l^l^l^.J = P(^)E„ (t b 1 b ) = (1 - u) M E v (t b 1 b ) (3.42) 
On the other hand, 

E (l ve (1 - l A il B i) T 6 . \ > E(l - l A il B t) - max (1 - P CT >(£>)) 

\ J a'eS n \m] 

> 1 - (1 - v) M - e~ cN . (3.43) 

Altogether (recall that M = c 2 N), 

oo 

Kr B > E,(r B l e )(l-z/r Ar ^(l-(l-z/r Ar -e- c7V ) 



[i - [L-u)— -e 

fc=0 



1 - e~ cN 

which tends to E^tb if u < c/c 2 . This concludes the proof of Theorem ll.il 



3.5. Extension to the case when m(a) ^ 777.(77) • V erv little has to be changed 
if we replace the condition that we start in a configuration a that has the same 
mesoscopic magnetization as r\, but for which (11.111) still holds. In that case, we 
cannot start the coupling in the first cycle, so we simply have to wait until time A 
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(provided V° occurs, i.e. a t does not hit B before that time). This means that we 
replace (137T8D by 



fc-i 



t b = J2\ 1 » (0) II 1 ©w t 1 - \ (Ok-i + rj 77 ) 1^1 B * + rg(l - 1^(0)) 

fc=l { 1=1 ) 

oo ( "| 



fc=l V £=1 



(3.45) 

We then proceed exactly as before to get 

oo 

Kr B > E v (r B l B )(l-ur N Y,^-e- C2N ){l-(l-iy) ClN - e~ c * N ) k 

k=0 

(1 - e"^) (1 - e-"") 
£ ^ \ + {1 -il- n » e -J (3.46) 

which is virtually equivalent to the previous case. 

3.6. The Laplace transform. Next we show that the same coupling can also be 
used to show that the Laplace transform of t b depends very little on the initial 
conditions within a set A. Set T = E ua t b . 

Proposition 3.4. If A,B satisfy the hypothesis of Theorem 11.11 then, for every con- 
figurations a,rj G A and A > 0, 

R a (\) = E CT (e-* 7 *) = E„ (e"* 7 *) (1 + o(l)) (3.47) 

The proof of Proposition [331 involves some estimates and computations that we 
collect in the following lemmas. 

Lemma 3.5. There exists a constant, c > 0, independent of n, such that, for any 

rjeA, 



E„ (l B e-^ TB J > E„ (e-^ TB J (1 - e" cAf ) (3.48) 

Proof. The proof is similar to that of (I3.20D and uses some of the estimates given 
there. The aim is to prove that 

E„ (l ec e-? rs ) < E,, (V? TS ) e~ cN . (3.49) 

By Jensen's inequality, for every rj e A, 

E v (V^ rs ) > e ~^ TB = e~ x (l + o(l)), (3.50) 

where the second line follows form the pointwise estimate on ~E, n r B that was proven 
in the previous subsections. To prove Q3.49D , it is enough to notice that, by Lemma 
(1331) . 



E v ( l B ce-T TB ) < e~ CI \ (3.51) 

□ 
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Proof of Prop. 13.41 For simplicity consider the case when m(a) = m(r}) = m. Anal- 
ogously to Eq. (I3.18D , we obtain 

/ oo fc-1 \ 

E.(e-7 Tfl ) = Ef^e-^-^'h^l^JI 1^(1-1^)1^)) (3 - 52) 
\fc=o £=o / 

too fc— 1 

^ (i _ lvk ) (i _ i Ak t Bk ) n i PW (i - 1^, i s ,) 
fc=0 £=0 
fc-1 



fc=o V t=o 

oo / fc— 1 



fc=0 V £=0 

Now, for every k,£>0, as in (f3~42D . 

I {l Ak t Bk e~^ T B n \F ek _^ < (1 - z/) M E,, (V^ rB ) . (3.53) 
Moreover, as in (|3~43l) . 



E^l^)(l-l^)l B ,)|jr^ < E^l-l^oit 

= 1-P(U)P(£) 

< l-(l-u) M (l-e- cN ). (3.54) 

This last estimate, together with Equation (13.21D of Lemma 13.31 shows that the 
last line in (13.521) is smaller than 

oo 

J2e~ cN (1 - (1 - v) M (l - e~ cN )) k < 2e~ N ^' c ^ (3.55) 

fc=0 

Combining these estimates, we arrive at 

oo 

E a (e-7 T A < E r ,(e-^)(l-z/) C27V ^(l-(l-z/) C2jv (l-e- cJV )) fc 

fc=0 

< E„ (e-^ T A (1 - e~ cN ) + 2e~ Ni - c ~ c ^ 

= Er, (V^ rs ) (1 + 3e-^ c - C2 ^) (3.56) 
which tends to E,, ^e"^ rs J if v < c/c 2 . □ 

4. Renewal and the exponential distribution for the RFCW 

We will use the results of Section [2] and the notation introduced therein. In 
particular, for each n fixed, we set A = S n [m*} and As = S n [As], where As is the 
mesoscopic ^-neighborhood of m*. In the sequel we choose n appropriately large 
and 5 appropriately small. 

In the case of the RFCW model we prove the convergence of the law of the 
normalized metastable time, r B , to an exponential distribution, via convergence of 
the Laplace transform, R a (X), defined in Q3.47D . The proof of the latter is based on 
renewal arguments. 
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4.1. Renewal equations. By Proposition |3.4[ instead of studying the process start- 
ing in a given point, a, for which no exact renewal equation will hold, it is enough 
to study the process starting on a suitable measure on A, for which such a relation 
will be shown to hold. For A > 0, let p A denote the probability measure on A that 
satisfies the equation, 

5>a(")E„ {e-^l TA<TB l a{TA)=a ^ = C(X)p x (a'), (4.1) 
for all a' e A, where 

C(X)=E Px (e-^l TA<TB y (4.2) 

Existence and uniqueness of such a measure follows from the Perron-Frobenius 
theorem in a standard way. 

The usefulness of this definition comes from the fact that the Laplace transform 
of t b started in this measure satisfies an exact renewal equation. 

Lemma 4.1. Let R Px (X) = p\(<j )R a (X). Then 

E Px (e-^l Tfl<TA ) 

Rpx W = ~\ (4 - 3) 

l-E Px (e-T^l rA<TB j 
Proof. Using that 1 = t TB<TA + ^t A <t b and the strong Markov property, we see that 

R Px (X) = E p Je-^l TB<TA ) (4.4) 



+ £ E px (e-^l TA<TB l <TA)=a >) E a ,e-^ B 
= E px (e-^ TB l TB<TA ^ + J2 C{\) Px {a')E a ,e-^ B 

a'eA 

Equation A4.3D is now immediate. □ 

4.2. Convergence. As a result of the representation (14.3D , Theorem ll.2l will follow 
from (I3.47D once we prove the following lemma. 

Lemma 4.2. With the notation from Lemma WA] for any \>0, 

E px (e~^l TB<TA ) 1 
lim ^— '-^ = — — . (4.5) 

^l-E PA (e-^l r , <Tfl ) 1 + A 

Proof. The proof of this lemma comprises seven steps. 
STEP 1. Define T A = E px . We claim: 

Lemma 4.3. There exists c 8 > 0, such that, for any X > fixed, 

r ^ p E (r T <r) < 1 + ° (e "" A ' ) )- (4 ' 6) 
^ Px {tb < r A ) 
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Indeed, 

T A = E Px (t b 1 {tb<ta} )+E Px (t b 1 {ta<Tb} ) (4.7) 
= E PA r AuB + E PA (l {TA<rs }E CT ( rA )T B ) 
= E px r AuB + T x F px (r A <r B ) 

+ E PA (l{r A <- S } ( E -(rA) r B - T A)) • 

However, by the invariance of p\, 

E PA (l {TA<TB} e-^ TA (E a{TA) r B — Txj^j = 0. (4.8) 
It follows that the absolute value of the last term in (14. 7D is bounded above as, 

E PA ( (l - e-^ T A 1 {ta<tb} ) max \E a r B - T x \ (4.9) 



< A max 

a£ A 



E a r B - T x 



o-eA 

E PA {TAt {TA < TB} ) = o(e~ CN ^)E px (t a 1 {ta<Tb} ) 



T 

where we used (11.12D in the last step. This implies the claim of the lemma. □ 
STEP 2. Control of p x -measure. 

Lemma 4.4. There exists c 9 < oo, such that for any n (and hence e = e(n)) fixed, 
as soon as N is large enough. 

Proof In order to prove (14. 101) first of all note that by reversibility, 

/V)IV if A < t b; a(r r A ) = a)= ^(a)¥ a (r r A < r B ) , (4.11) 

a'eA 

where t a is the r-th hitting time of A. Assume now that we are able to prove that 
there exists r and M such that 

P, (r A < r B ; a(r r A ) = a) < (1 - e)- M F a , (r r A < r B ; a(r r A ) = a) , (4.12) 

uniformly in r/, a, a' e A. In view of fl4.ll) this would imply, 

pxip) < ^7 ^PxivWv ( t a < T B ) °{t a ) = a) 

(4.13) 

< 1 ~^ )r Vo (r A <T B ;a(r r A ) = a). 

Multiplying both sides above by p(a') and applying Q4. IIP , we conclude that A4.12D 
implies that 

uniformly in a e A. The target (14.1 OP , therefore, will be a consequence of the 
following two claims: There exists c > 0, such that, independently of the coarse 
graining parameter n, 

C{\) > l-e" cAr , (4.15) 
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A 
T 



TA < T B 

T A <r B )> (4.16) 



as soon as iV is sufficiently large. Furthermore, for sufficiently large c 2 and k, 
(l4~T2l) holds with M = c 2 N and r = iV K . 

We first show that 04.151) holds. By the uniform bound (I2.21D and Jensen's 
inequality, it follows 

C(A)> ((l-e- cJV )^p A (a)E ff ( e H 

^p A (o-)E (J frA 

AE PA (rAl{ TA<rB }) 
T (1 — e~ cJV ) 

F U n W T a P pa ( r £ < r A) (A - 7 s 

E px [t a 1 {ta<Tb} ) < 1 + Q ^_ CsN ^ , (4.17) 

and (I4J5D follows by (IZ2TT) and (fL12l) . 
Next we show that of (I4.12D holds. There exists c w < oo such that 

P„ (r r A < r B ; a Tl = a) > e~ cwN ¥ v (r r A < r B ) , (4.18) 

uniformly in a, r/ e A. This is a rough estimate: by the Markov property, 

P„ (r r A < t b ; a T r = a) > P„ (r£ < t b ) minP v (r A < r B ; a TA = a) . (4.19) 

Jj'SA 

Let r/' e A and let the Hamming distance between a and r/' be K. Then we can 
reach a from r/ by flipping exactly K spins; since this can be done in K\ orders, 
and each flip has probability at least ((1 — a) /N) by Q1.7D , we see that 

P,' (r A < r B ; a TA = a) > -^(1 - a) K , (4.20) 

and (I47T8D follows. 

Next, let i] e A and consider a dynamics starting from rj. We shall try to couple it 
with a dynamics starting from a' using just one basic coupling attempt. Employing 
the same notation as in Subsection 13.21 we know (see (I3.28D and A3.40D ) that for 

k > 2 and M = c 2 N, 

P„ (S > N K ,Af > M) < e~ C(iN , (4.21) 

where c 6 grows linearly with c 2 . In the sequel we choose c 2 so large that c 6 becomes 
larger than the constant c w in A4.18D . 

Let us redefine the event B in (f3~7L3l as £ = {£ < iV K } n {A/" < A/}. The coins 
Vi, . . . , Va/ and the event A = {V^V* = 1} remain the same. Consider the en- 
larged probability space (fl, Pj which corresponds to a single basic coupling at- 
tempt to couple a dynamics a(t) from a' to the dynamics rj(t) which starts at rj. 

The coupling is successful if and only if the event A fi B, which depends on at 
most N K steps, happens. Therefore, 

(t a < r B ; a{r r A ) = a) > P (N K < r r A < r B ; V (r r A ) = a; A; B) (4.22) 

= ¥ 11 (N K <T r A <T B ;r ] (T r A ) = a;B)¥(A) 
= F v (N«<T r A <T B]V (r r A ) = a;B)(l-e) M . 
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Now, let us choose r = N K . In particular, the constraint N K < t t a becomes redun- 
dant. By (I2.21D and in view of 04.18D and our choice of M which leads to a large 
c 6 in Q4.21D , there exists en > such that, 

¥ v (T A <T B ; V (r r A ) = a;B) > P„ (t a < r B ; V (r r A ) = a) - F V (B C ) (4.23) 



> F ri (r A <r B ; V (r r A )=a){l-e'^ N ). 



(I4TT2D follows. 



□ 



STEP 3. The following crucial bound, to which we refer to a uphill lemma, will be 
proven in the next subsection: 

Lemma 4.5. There exists c 7 > such that 

E Px (r B t TB<TA ) < e" C7jv E p , (taWb) . (4.24) 
Remark. Intuitively, the bound Q4.24D should follow from the decomposition 

Ktaub = K (r A < r B ) E a (t a \t a < r B ) + P CT (r fl < r A ) E CT (r B \r B < r A ) (4.25) 

since the first probability on the right hand side is close to one, the second is 
exponentially small, and the two conditional expectations should be of the same 
order. It seems, however, remarkably difficult to establish such a result uniformly 
in the starting point a e A, for the same reasons why the pointwise control of 
mean exit times is difficult. 

We shall proceed with the proof assuming that C4.24D holds. 
STEP 4. In view of (14.6D , a look at (14.241) reveals that the conditional expectation, 



E PA [t b I r B < t a ] 



T 



c 7 N\ 



o(e- C77V ). 



(4.26) 



Using that, for x > 0, 1 > e x > 1 — x, it follows that the numerator in (14.31) 
satisfies 

E px (e-^l TB<TA ) = F px {t b < t a ) (1 + o(e- C77V )) . (4.27) 



STEP 5. Let us turn now to the denominator in (14.31) . We rewrite it as 

/ 



Pp. (tb < ta) 



E 



PA 



1 + A- 



1 — e t 



AP„ (t b < t a ) 



V 



/ 



Using 04.61) for 1/P PA (t b < ta), we are left with the computation of 

T 

1 {ta<t B } 



AE Pa (t a 1 {ta<Tb} ) 



E 



Since, 



E 



PA 



1 -e"T 1 



1 



{tA<tb} 



Pa 



A 



1-e'T 



A 



sX 



- I E px [e-T^r A l {TA<TB} )ds 



we deduce that the expression in Q4.29D belongs to the interval 

E Px e^ TA T A l {TA<TB } x 

E px T A l{r A <r B } 



(4.28) 



(4.29) 



(4.30) 



(4.31) 
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The target A4.5D follows once we show that 

lim = 1. (4.32) 

It is clear that A4.32D follows as soon as we check that there exists a sequence 
a at l 0, such that 

lim E PA T A 1 {r A <r B } 1 {r A <a N T} _ 1 - 4 
Af^oo E pA TAl{ TA <r B } 

This will be our next goal. 

Let 5,5 = Sn \ As. Our proof of (14.331) is based on the following decomposition, 

E px T A l{ TA <T B }Mr A >a N T} < E px T A ^{ TA <r B J Mt a >cc n T} + E Px t aI{ TBs<ta<tb } 

= Is + lis- 

The logic behind this decomposition should be transparent: The conditional (on 
t~a < t~b) landscape should have the global mesoscopic minima at A. The term I^ is 
a local one and should be small, since the dynamics cannot spend too much time 
inside a local well As without hitting A. On the other hand, the term 11^ should be 
small because of the price paid for the uphill run towards B s before hitting A. We 
claim that there exists a N I and c > such that 

max {I s , Us} < e~ cN E Px t a 1 {ta<tb} . (4.35) 

Evidently, (I4.33D is a consequence of (14.351) . 

STEP 6. Bound on Lj. The term 1,5 is bounded above as 

l s < maxE a T AuBs I{ TAuBs>aNT y (4.36) 

The right hand side of 04.361) depends on the dynamics in a ^-neighbourhood of a 
non-degenerate local minimum A = S n [m*]. We try to formalize an intuitive idea 
that such dynamics mixes up on time scale much shorter than T and cannot afford 
spending a N T units of time without hitting AU B s . This is a somewhat coarse 
estimate. Let us start with estimating hitting times from equilibrium measure over 
mesoscopic slots: 

Lemma 4.6. Let As and Bs be as defined above. Then there exists c(8), satisfying 
c(8) | 0, as 5 | 0, such that, for all mf e A$\ m*, 

E^taubs < e c(5)Ar , (4.37) 

where v m > is the probability measure on S n [rn/], which we referred to in Eq. ftl.4h 

Proof. By formula (I1.4D , we have that 

K m ,r AuBs = —7— , AuB) Yl ^M a ) h s n \m'],s[AuB s ]{o-) (4.38) 

Py — ' S) a&A s \A 

< 1 / \ _ W,n(A\A s ) 

- cap(m',A) a ^/ P ' N{(X> cap(m',A) 

Note that we used here only the crudest possible estimate on the harmonic func- 
tion hsnirn>],AuB s (c), but the results of [1] do not give us anything much better. It 
remains to bound the capacity cap(m', A) from below. However, this is relatively 
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easy using the methods explained in Section 5 of [1], to which we refer for further 
details. One gets that 

cap(w', A) > e~ cSeN fi^ N (mf) . (4.39) 

□ 

As a consequence we obtain: 

Lemma 4.7. Let A s and B$ be as defined above. Then there exists c(5) satisfying 
c(8) I as 5 I 0, such that, for all i] e A S \A, 



c(S)N\ ^ (! - "M) 



M 



P„ (taub s < 2e^ N ) > ^ "-f>> (4.40) 

where 1 — v{n) is the probability (I3.4D of a successful single coin-flip and M = c 2 N is 
the number of coins. 

Proof. As the formulation of the Lemma suggests, we use the basic coupling as 
described in the preceding section: Let m' € A5 and 77,0- 6 S n [m/}. Define the 
event B as in (I3.13D . In fact, since we are interested in taub s , the first constraint in 
(I3.13D becomes redundant and we can redefine B simply as 

B = {S <N K }n{N < M} . (4.41) 

Then, performing our basic coupling attempt we infer that, for any 77, a G S\m'], 

P„ {taub s < 2e c ^ N ) > (1 - u{n)) M P CT (r AuBs < e c ^ N ; B) (4.42) 

By Lemma 14.61 and Chebyshev's inequality 

P^, {taub s < 2e c ^) > I (4.43) 
and, in view of the bound ([3~7l9l) , (R~4Ql) follows. □ 
Let us go back to (14.36D . By Lemma 14771 

max K (r AuBs > k2e^ N ) < (l - (1 ~ * . (4.44) 

Therefore, as follows by a straightforward application of the tail formula, 

Is < e~ CsN , (4.45) 

as soon as 

a N T > 3c 8 iVe (c{5)+iy(n))Ar . (4.46) 

Since T ~ e CN with C > being, of course, independent of our choice of 5 and 
n, it is always possible to tune the parameters 5, n and a N j. in such a way that 
(147461) holds. 

STEP 7. Bound on ILj. Note that 

E ^ ( Ta1 {tb s <t a <t b })= E P, (W-K^}) +Ep * ( 1 {rB 5 <r4 E ^)^l{r A <r B} ) ■ 

(4.47) 

By the Uphill Lemma (see (14.241) above) the first term in (14.47D is negligible with 
respect to E Pa ta1| ta<ts 1. Therefore, the bulk of the remaining work is to find an 

appropriate upper bound on the second term in (14.47D . 
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By the Downhill Lemma (see (14.60D below) we would be in good shape if we 
would have the original reversible measure /i instead of the p x eigen-measure de- 
fined in 04. ID . Namely, as it is explained in the end of Subsection I4.3L (14.601) 
implies that, independently of n, there exists c$ > such that 

-L J>(* (i{ TBs <r A }^ B6) r A l {TA<TB} ) < e~ c ° N , (4.48) 

as soon as iV is large enough. However, since we have already established in (14.101) 
that pa is, up to arbitrary small exponential corrections, controlled by /i, it follows 
that the second term in (I4.47D is exponentially small and hence also negligible with 
respect to E Px t a 1 {ta<tb ^. 

The proof of Lemma [4721 is now complete. 

□ 

4.3. Uphill and Downhill Lemmas. In this subsection we shall prove (I4.24D and 
(14T481) . 

Proof, (of Lemma I4.5D Instead of proving A4.24D directly, we will first show the 
(more natural) estimate 

(r B l TB<TA ) < e- cN fi(A), (4.49) 

o-eA 

for some c > 0. To do so, we use the fact that 

E a r AuB = E a {t a 1 ta<tb ) + E a {t b 1 tb<ta ) . (4.50) 

Define the function 

VjlaW = i E ' {TAU * <nh lf<r ^ UB - (4.51) 
I 0, else. 

w AtB solves the Dirichlet problem 

Lw A>B (a) = h A , B (a), a ^ AU B, (4.52) 
w AB (a) = 0, a G A U B, (4.53) 

where L = I — P. Notice that, for a e A, 

K (r A l TA<TB ) = F a {t a < t b ) - Lw A>B {a). (4.54) 
Next, using reversibility, 

^ fx(a)h A<B (a)Lw AjB (a) = ^ fi(a)Lh AjB (a)w AB (a). (4.55) 

(7 (J 

By the properties of the functions h AtB and w AiB , this equation reduces to 

-^2n(a)Lw AB (a) = ^ n(o-)h A:B (a) 2 . (4.56) 

Hence, 

^2n{a)E (r (T A l TA<TB ) = ^2n(a)¥ ir (T A <T B )+ ^ v{<r)h A)B {a) 2 . (4.57) 

o-eA ctgA o-gAuB 
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Using a completely similar procedure, one shows that 

^2 i2(a)E a (t AuB ) = £//(<r) + Yl ^) h A,B^)- (4.58) 

Therefore, taking into account A4.50D , 

J2fi{a)E a {T B l TB<TA ) = J2fi{a)¥ (7 {T B <r A )+ £ K°) h AA°) h BA°)- (4.59) 

The first term on the right is exponentially small compared to fi(A) by Lemma [2721 
The same holds true for the second term, by the same estimates that were used 
in the proof of Lemmata 6.1 and 6.2 in [1]. Thus G4.49D holds. By Lemma l4~4l 
it follows that that for a slightly smaller constant c', E Px t b 1 tb<ta < e~ c ' N . Finally, 
E Px r AuB > 1, and so E Px t a 1 ta<Tb > 1 - e~ c>N , and we can deduce (14.241) . This 
concludes the proof of the lemma. □ 

The microscopic harmonic function h(a) = P (t a < t b ) gives rise to the so called 
/i-transformed chain with transition probabilities p%(a, a') = h(a)~ 1 pN(o; a')h(a'). 
This /i-transformed chain lives on {a : h(a) > 0} and it is revesible with respect to 
fi h = h 2 /j,. The following Downhill Lemma holds : 

Lemma 4.8. With the notation introduced before, 

(l {TBs<TA f <TBs) r A t {TA< r B} ) < A°')K'(tb s <t a ). (4.60) 

o-e A a'eA s \A 

Proof. By reversibility, 
Hence, 

( 1 {t Bs <t a } E *(tb s ) t aMt A <t b }) = Y v(vWv( t a < r Bs )E v (t a 1 {ta<tb} ) . 

(4.62) 

Since the only non-zero contribution to the latter sum comes from r/ in the exte- 
rior boundary of As, we can bound it from above in terms of the /^-transformed 
quantities as 

J2» h (v)V h n (TA<T B5 )E%TA. (4.63) 

V^Bg 

Applying the representation formula (11.41) for hitting times for the /i-transformed 
dynamics, we can represent the above sum as 

£ MK>{t Bs <t a ), (4.64) 

a'eA s \A 



and (147601) follows. □ 



Let us go back to (14.48D . Using an estimate completely analogous to Lemma [2721 
one sees that 

£ MK{TB t <T A )< £ fM(a')h(a')F a ,(T B5 <T A )<n(A 5 \A)e- c * N , 

a'<=A s \A a'£A s \A 

(4.65) 

for some c§ > 0. This allows to deduce (14.481) from 04. 601) . 
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